Picture for Bangya Liu

Bangya Liu

Dr. DocBench: A Comprehensive Benchmark for Expert-Level and Difficult Document Parsing

Add code
May 31, 2026
Viaarxiv icon

IntentionNav: A Benchmark for Intent-Driven Object Navigation from Implicit Human Instruction

Add code
May 22, 2026
Viaarxiv icon

Accelerating Transformer-Based Monocular SLAM via Geometric Utility Scoring

Add code
Apr 13, 2026
Viaarxiv icon

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

Add code
Apr 06, 2026
Viaarxiv icon

SpatialStack: Layered Geometry-Language Fusion for 3D VLM Spatial Reasoning

Add code
Mar 28, 2026
Viaarxiv icon

Egocentric World Model for Photorealistic Hand-Object Interaction Synthesis

Add code
Mar 13, 2026
Viaarxiv icon

Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis

Add code
Mar 13, 2026
Viaarxiv icon

MV-S2V: Multi-View Subject-Consistent Video Generation

Add code
Jan 27, 2026
Viaarxiv icon

ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Add code
Dec 28, 2025
Viaarxiv icon

CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization

Add code
Dec 22, 2025
Figure 1 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 2 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 3 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 4 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Viaarxiv icon